SlideShare a Scribd company logo
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology Metagenomics Invited Talk  Metagenomics 2006 Calit2 @ UCSD La Jolla, CA October 4, 2006 Dr. Larry Smarr Director, California Institute for Telecommunications and Information Technology Harry E. Gruber Professor,  Dept. of Computer Science and Engineering Jacobs School of Engineering, UCSD
Challenge: Average Throughput of NASA Data Products  to End User is < 50 Mbps  Tested October 2005 http://ensight.eos.nasa.gov/Missions/icesat/index.shtml Internet2 Backbone is 10,000 Mbps! Throughput is < 0.5% to End User
Dedicated Optical Channels Makes  High Performance Cyberinfrastructure Possible Parallel Lambdas are Driving Optical Networking  The Way Parallel Processors Drove 1990s Computing ( WDM) Source: Steve Wallach, Chiaro Networks “ Lambdas”
National Lambda Rail (NLR) and TeraGrid Provides  Cyberinfrastructure Backbone for U.S. Researchers San Francisco Pittsburgh Cleveland San Diego Los Angeles Portland Seattle Pensacola Baton Rouge Houston San Antonio Las Cruces / El Paso Phoenix New York City Washington, DC Raleigh Jacksonville Dallas Tulsa Atlanta Kansas City Denver Ogden/ Salt Lake City Boise Albuquerque UC-TeraGrid UIC/NW-Starlight Chicago International  Collaborators NLR 4 x 10Gb Lambdas Initially Capable of 40 x 10Gb wavelengths at Buildout NSF’s TeraGrid Has 4 x 10Gb  Lambda Backbone  Links Two Dozen State and Regional Optical Networks DOE, NSF, & NASA Using NLR
The OptIPuter Project – Creating High Resolution Portals  Over Dedicated Optical Channels to Global Science Data NSF Large Information Technology Research Proposal Calit2 (UCSD, UCI) and UIC Lead Campuses—Larry Smarr PI Partnering Campuses: SDSC, USC, SDSU, NCSA, NW, TA&M, UvA, SARA, NASA Goddard, KISTI, AIST,  CRC(Canada), CICESE (Mexico) Engaged Industrial Partners: IBM, Sun, Telcordia, Chiaro, Calient, Glimmerglass, Lucent $13.5 Million Over Five Years—Now In the Fifth Year NIH Biomedical Informatics Research Network NSF EarthScope and ORION
OptIPuter Software Architecture--a Service-Oriented Architecture Integrating Lambdas Into the Grid GTP XCP UDT LambdaStream CEP RBUDP Globus XIO GRAM GSI Source: Andrew Chien, UCSD DVC Configuration Distributed Virtual Computer (DVC) API DVC Runtime Library Distributed Applications/ Web Services Telescience Vol-a-Tile SAGE JuxtaView Visualization  Data Services LambdaRAM DVC Services DVC Core Services DVC Job Scheduling DVC Communication Resource  Identify/Acquire Namespace Management Security Management High Speed Communication Storage Services IP Lambdas Discovery  and Control PIN/PDC RobuStore
Calit2 “Lives in the Future” By Building Systems  of Emerging Disruptive Technologies Co-Evolution of Personal Automobile and Highway/Petroleum Infrastructure  Source: Harry Dent,  The Great Boom Ahead Technologies Diffuse Into Society Following an S-Curve Calit2 Works Here {
Calit2--A Systems Approach to the Future of the Internet and its Transformation of Our Society www.calit2.net Calit2 Has Assembled a Complex Social Network  of Over 350 UC San Diego & UC Irvine Faculty Working in Multidisciplinary Teams With Staff, Students, Industry, and the Community Integrating Technology Consumers and Producers Into “Living Laboratories”
Calit2 Brings Computer Scientists and Engineers  Together with Biomedical Researchers Some Areas of Concentration: Metagenomics Genomic Analysis of Organisms Evolution of Genomes Cancer Genomics Human Genomic Variation and Disease Proteomics Mitochondrial Evolution Computational Biology Information Theory and Biological Systems UC San Diego UC Irvine
Evolution is the Principle of Biological Systems: Most of Evolutionary Time Was in the Microbial World Source: Carl Woese, et al You Are Here Much of Genome Work Has Occurred in Animals
PI Larry Smarr Paul Gilna Ex. Dir. Calit2 is Now Attracting Private Foundation Grants Announced January 17, 2006--$24.5M Over Seven Years
Marine Genome Sequencing Project –  Measuring the Genetic Diversity of Ocean Microbes Sorcerer II Data Will Double Number of Proteins in GenBank!
Current Universe of  Medium/ Large Protein Families Source: Shibu Yooseph, et al. (PLOS Biology in press 2006) Protein Families Conserved Across Tree of Life  Protein Families Unique to GOS  17,067 Protein Family Clusters
 
Calit2’s Direct Access Core Architecture  Will Create Next Generation Metagenomics Server Traditional User Response Request Source: Phil Papadopoulos, SDSC, Calit2 + Web Services Sargasso Sea Data Sorcerer II Expedition (GOS) JGI Community Sequencing Project Moore Marine  Microbial Project NASA Goddard  Satellite Data Community Microbial Metagenomics Data Flat File Server Farm W E B  PORTAL Dedicated Compute Farm (1000 CPUs) TeraGrid: Cyberinfrastructure Backplane (scheduled activities, e.g. all by all comparison) (10000s of CPUs)  Data- Base Farm 10 GigE  Fabric Web (other service) Local  Cluster Local Environment Direct Access  Lambda Cnxns
The Bioinformatics Core of the Joint Center for Structural Genomics will be Housed in the Calit2@UCSD Building Extremely Thermostable -- Useful for Many  Industrial Processes (e.g. Chemical and Food)  173 Structures (122 from JCSG) Determining the Protein Structures of the Thermotoga Maritima Genome  122 T.M. Structures Solved by JCSG  (75 Unique In The PDB)   Direct Structural Coverage of 25% of the Expressed Soluble Proteins Probably Represents the Highest Structural Coverage of Any Organism Source: John Wooley, UCSD
Interactive Visualization  of Thermatoga Proteins at Calit2 Source: John Wooley, Jurgen Schulze, Calit2
OptIPortal–  Termination Device for the OptIPuter Global Backplane 20 Dual CPU Nodes, 20 24” Monitors, ~$50,000 1/4 Teraflop, 5 Terabyte Storage, 45 Mega Pixels--Nice PC! Scalable Adaptive Graphics Environment ( SAGE)  Jason Leigh, EVL-UIC Source: Phil Papadopoulos SDSC, Calit2
Calit2 is Now OptIPuter Connecting Remote  Moore-Funded Microbial Researchers NW! CICESE UW JCVI MIT SIO UCSD SDSU UIC EVL UCI OptIPortals OptIPortal
Calit2 and the Venter Institute Will Combine Telepresence with Remote Interactive Analysis Live Demonstration  of 21st Century  National-Scale  Team Science OptIPuter  Visualized  Data HDTV  Over  Lambda 25 Miles Venter Institute
Countries are Aggressively Creating Gigabit Services: Interactive Access to CAMERA and LOOKING Systems www.glif.is Created in Reykjavik, Iceland 2003 Visualization courtesy of Bob Patterson, NCSA.
New OptIPuter Driver: Gigabit Fibers on the Ocean Floor -- Controlling Sensors and HDTV Cameras Remotely National Science Foundation Is Planning a New Generation of Ocean Observatories Ocean Research Interactive Observatory Networks (ORION) Fibered Observatories Linked to Land Fiber Infrastructure Laboratory for the Ocean Observatory Knowledge Integration Grid ( LOOKING ) Building a Prototype Based on OptIPuter Technologies Plus Web/Grid Services HDTV Streams Over IP Will  be a Major Driver (Funded by NSF ITR- John Delaney, UWash, PI) LOOKING is Driven By  NEPTUNE CI Requirements Making Management  of Gigabit Flows Routine
Using the OptIPuter to Couple Data Assimilation Models  to Remote Data Sources Including Biology Regional Ocean Modeling System (ROMS)  http://ourocean.jpl.nasa.gov/ NASA MODIS Mean Primary Productivity  for April 2001 in California Current System
Deploying Novel Infrastructure Enables New Science: Gigabit Fibers on the Ocean Floor Source: John Delaney & Deborah Kelley, UWash Canadian-U.S. Collaboration An Experiment in the NSF Laboratory for the Ocean Observatory Knowledge Integration Grid (LOOKING) ITR  Prototype of CI for NSF’s ORION
High Definition Still Frame  of Hydrothermal Vent Ecology 2.3 Km Deep  White Filamentous Bacteria on 'Pill Bug' Outer Carapace Source:  John Delaney and Research Channel,  U Washington 1 cm.
A Near Future Metagenomics  Fiber Optic-Enabled Data Generator Source John Delaney, UWash

More Related Content

PPT
Building an Information Infrastructure to Support Genetic Sciences
Larry Smarr
 
PPT
High Performance Cyberinfrastructure to Support Data-Intensive Biomedical Res...
Larry Smarr
 
PPT
Collaborations Between Calit2, SIO, and the Venter Institute-a Beginning
Larry Smarr
 
PPT
Collaborations Between Calit2, SIO, and the Venter Institute-a Beginning
Larry Smarr
 
PPT
Genomic Research: The Jump to Light Speed
Larry Smarr
 
PPT
Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analys...
Larry Smarr
 
PPT
A National Big Data Cyberinfrastructure Supporting Computational Biomedical R...
Larry Smarr
 
PPT
Building an Information Infrastructure to Support Microbial Metagenomic Sciences
Larry Smarr
 
Building an Information Infrastructure to Support Genetic Sciences
Larry Smarr
 
High Performance Cyberinfrastructure to Support Data-Intensive Biomedical Res...
Larry Smarr
 
Collaborations Between Calit2, SIO, and the Venter Institute-a Beginning
Larry Smarr
 
Collaborations Between Calit2, SIO, and the Venter Institute-a Beginning
Larry Smarr
 
Genomic Research: The Jump to Light Speed
Larry Smarr
 
Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analys...
Larry Smarr
 
A National Big Data Cyberinfrastructure Supporting Computational Biomedical R...
Larry Smarr
 
Building an Information Infrastructure to Support Microbial Metagenomic Sciences
Larry Smarr
 

What's hot (20)

PPTX
Creating a Science-Driven Big Data Superhighway
Larry Smarr
 
PPT
Calit2 - CSE's Living Laboratory for Applications
Larry Smarr
 
PPTX
Pacific Wave and PRP Update Big News for Big Data
Larry Smarr
 
PPT
Physics Research in an Era of Global Cyberinfrastructure
Larry Smarr
 
PPT
Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analys...
Larry Smarr
 
PPT
Briefing to External Relations Staff
Larry Smarr
 
PPT
Calit2--Helping the University of California Drive Innovation in California
Larry Smarr
 
PPT
Emerging Trends
Larry Smarr
 
PPT
The UCSD Big Data Freeway System
Larry Smarr
 
PPT
Cyberinfrastructure for Ocean Cabled Observatories
Larry Smarr
 
PPT
How Global-Scale Personal Lightwaves are Transforming Scientific Research
Larry Smarr
 
PPTX
Pacific Research Platform Science Drivers
Larry Smarr
 
PPT
Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Pro...
Larry Smarr
 
PPT
Cyberinfrastructure for Ocean Observing
Larry Smarr
 
PPT
The Coming Revolution in Environmental Awareness
Larry Smarr
 
PPT
Calit2: a View Into the Future of the Wired and Unwired Internet
Larry Smarr
 
PPT
Cyberinfrastructure to Support Ocean Observatories
Larry Smarr
 
PPTX
Technology-Driven Disruptions in the Near Future
Larry Smarr
 
PPT
Coupling Australia’s Researchers to the Global Innovation Economy
Larry Smarr
 
PPT
Cross-Disciplinary Biomedical Research at Calit2
Larry Smarr
 
Creating a Science-Driven Big Data Superhighway
Larry Smarr
 
Calit2 - CSE's Living Laboratory for Applications
Larry Smarr
 
Pacific Wave and PRP Update Big News for Big Data
Larry Smarr
 
Physics Research in an Era of Global Cyberinfrastructure
Larry Smarr
 
Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analys...
Larry Smarr
 
Briefing to External Relations Staff
Larry Smarr
 
Calit2--Helping the University of California Drive Innovation in California
Larry Smarr
 
Emerging Trends
Larry Smarr
 
The UCSD Big Data Freeway System
Larry Smarr
 
Cyberinfrastructure for Ocean Cabled Observatories
Larry Smarr
 
How Global-Scale Personal Lightwaves are Transforming Scientific Research
Larry Smarr
 
Pacific Research Platform Science Drivers
Larry Smarr
 
Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Pro...
Larry Smarr
 
Cyberinfrastructure for Ocean Observing
Larry Smarr
 
The Coming Revolution in Environmental Awareness
Larry Smarr
 
Calit2: a View Into the Future of the Wired and Unwired Internet
Larry Smarr
 
Cyberinfrastructure to Support Ocean Observatories
Larry Smarr
 
Technology-Driven Disruptions in the Near Future
Larry Smarr
 
Coupling Australia’s Researchers to the Global Innovation Economy
Larry Smarr
 
Cross-Disciplinary Biomedical Research at Calit2
Larry Smarr
 
Ad

Viewers also liked (20)

PPT
Toward a Global Interactive Earth Observing Cyberinfrastructure
Larry Smarr
 
PPT
Towards GigaPixel Displays
Larry Smarr
 
PDF
A New Global Research Platform – Dedicated 10Gbps Lightpaths
Larry Smarr
 
PDF
Coupling Australia’s Researchers to the Global Innovation Economy
Larry Smarr
 
PPT
A PRAGMA-OptIPlanet Collaboratory Partnership
Larry Smarr
 
PPT
UC Capabilities Supporting High-Performance Collaboration and Data-Intensive ...
Larry Smarr
 
PDF
The Growing Interdependence of the Internet and Climate Change
Larry Smarr
 
PPT
Project StarGate An End-to-End 10Gbps HPC to User Cyberinfrastructure ANL * C...
Larry Smarr
 
PPT
Digital Infrastructure in a Carbon Constrained World
Larry Smarr
 
PPT
Report to the NAC
Larry Smarr
 
PPT
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
Larry Smarr
 
PDF
Shrinking the Planet: A New Global Research Platform –Dedicated 10Gbps Lightp...
Larry Smarr
 
PPT
The OptiPuter, Quartzite, and Starlight Projects: A Campus to Global-Scale Te...
Larry Smarr
 
PPT
Personal Data Tracking and the Digital Transformation of Healthcare
Larry Smarr
 
PDF
Preparing Your Campus for Data Intensive Researchers
Larry Smarr
 
PPT
Wildfires, Hydrology, and Microbes: Possible Areas for Collaboration with Calit2
Larry Smarr
 
PPT
Genomics at the Speed of Light: Understanding the Living Ocean
Larry Smarr
 
PPT
Calit2 - The First Five Years
Larry Smarr
 
PPT
Will the Quantified Self Movement Disrupt Healthcare?
Larry Smarr
 
PPT
An End-to-End Campus-Scale High Performance Cyberinfrastructure for Data-Inte...
Larry Smarr
 
Toward a Global Interactive Earth Observing Cyberinfrastructure
Larry Smarr
 
Towards GigaPixel Displays
Larry Smarr
 
A New Global Research Platform – Dedicated 10Gbps Lightpaths
Larry Smarr
 
Coupling Australia’s Researchers to the Global Innovation Economy
Larry Smarr
 
A PRAGMA-OptIPlanet Collaboratory Partnership
Larry Smarr
 
UC Capabilities Supporting High-Performance Collaboration and Data-Intensive ...
Larry Smarr
 
The Growing Interdependence of the Internet and Climate Change
Larry Smarr
 
Project StarGate An End-to-End 10Gbps HPC to User Cyberinfrastructure ANL * C...
Larry Smarr
 
Digital Infrastructure in a Carbon Constrained World
Larry Smarr
 
Report to the NAC
Larry Smarr
 
Building a Community Cyberinfrastructure to Support Marine Microbial Ecology ...
Larry Smarr
 
Shrinking the Planet: A New Global Research Platform –Dedicated 10Gbps Lightp...
Larry Smarr
 
The OptiPuter, Quartzite, and Starlight Projects: A Campus to Global-Scale Te...
Larry Smarr
 
Personal Data Tracking and the Digital Transformation of Healthcare
Larry Smarr
 
Preparing Your Campus for Data Intensive Researchers
Larry Smarr
 
Wildfires, Hydrology, and Microbes: Possible Areas for Collaboration with Calit2
Larry Smarr
 
Genomics at the Speed of Light: Understanding the Living Ocean
Larry Smarr
 
Calit2 - The First Five Years
Larry Smarr
 
Will the Quantified Self Movement Disrupt Healthcare?
Larry Smarr
 
An End-to-End Campus-Scale High Performance Cyberinfrastructure for Data-Inte...
Larry Smarr
 
Ad

Similar to Building a Community Cyberinfrastructure to Support Marine Microbial Ecology Metagenomics (20)

PPT
Genomics at the Speed of Light: Understanding the Living Ocean
Larry Smarr
 
PPT
High Performance Collaboration
Larry Smarr
 
PPT
The Future of the Internet and its Impact on Digitally Enabled Genomic Medicine
Larry Smarr
 
PPT
Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analys...
Larry Smarr
 
PPT
Calit2 - CSE's Living Laboratory for Applications
Larry Smarr
 
PPT
The OptIPuter Project: From the Grid to the LambdaGrid
Larry Smarr
 
PPT
Creating a Cyberinfrastructure for Advanced Marine Microbial Ecology Research...
Larry Smarr
 
PPT
OptIPuter: Metagenomics at Light Speed
Larry Smarr
 
PPT
Bringing Mexico Into the Global LambdaGrid
Larry Smarr
 
PPT
Using Supercomputers and Supernetworks to Explore the Ocean of Life
Larry Smarr
 
PPT
Positioning University of California Information Technology for the Future: S...
Larry Smarr
 
PPT
A Mobile Internet Powered by a Planetary Computer
Larry Smarr
 
PPT
Why Researchers are Using Advanced Networks
Larry Smarr
 
PPT
The Jump to Light Speed - Data Intensive Earth Sciences are Leading the Way t...
Larry Smarr
 
PPT
Blowing up the Box--the Emergence of the Planetary Computer
Larry Smarr
 
PPT
Ceoa Nov 2005 Final Small
Larry Smarr
 
PPT
The OptIPuter and Its Applications
Larry Smarr
 
PPT
The Future of Telecommunications and Information Technology
Larry Smarr
 
PPT
Advancing the Metagenomics Revolution
Larry Smarr
 
PPT
High Performance Collaboration – The Jump to Light Speed
Larry Smarr
 
Genomics at the Speed of Light: Understanding the Living Ocean
Larry Smarr
 
High Performance Collaboration
Larry Smarr
 
The Future of the Internet and its Impact on Digitally Enabled Genomic Medicine
Larry Smarr
 
Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analys...
Larry Smarr
 
Calit2 - CSE's Living Laboratory for Applications
Larry Smarr
 
The OptIPuter Project: From the Grid to the LambdaGrid
Larry Smarr
 
Creating a Cyberinfrastructure for Advanced Marine Microbial Ecology Research...
Larry Smarr
 
OptIPuter: Metagenomics at Light Speed
Larry Smarr
 
Bringing Mexico Into the Global LambdaGrid
Larry Smarr
 
Using Supercomputers and Supernetworks to Explore the Ocean of Life
Larry Smarr
 
Positioning University of California Information Technology for the Future: S...
Larry Smarr
 
A Mobile Internet Powered by a Planetary Computer
Larry Smarr
 
Why Researchers are Using Advanced Networks
Larry Smarr
 
The Jump to Light Speed - Data Intensive Earth Sciences are Leading the Way t...
Larry Smarr
 
Blowing up the Box--the Emergence of the Planetary Computer
Larry Smarr
 
Ceoa Nov 2005 Final Small
Larry Smarr
 
The OptIPuter and Its Applications
Larry Smarr
 
The Future of Telecommunications and Information Technology
Larry Smarr
 
Advancing the Metagenomics Revolution
Larry Smarr
 
High Performance Collaboration – The Jump to Light Speed
Larry Smarr
 

More from Larry Smarr (20)

PPTX
Smart Patients, Big Data, NextGen Primary Care
Larry Smarr
 
PPTX
Internet2 and QUILT Initiatives with Regional Networks -6NRP Larry Smarr and ...
Larry Smarr
 
PPTX
Internet2 and QUILT Initiatives with Regional Networks -6NRP Larry Smarr and ...
Larry Smarr
 
PPTX
National Research Platform: Application Drivers
Larry Smarr
 
PPT
From Supercomputing to the Grid - Larry Smarr
Larry Smarr
 
PPTX
The CENIC-AI Resource - Los Angeles Community College District (LACCD)
Larry Smarr
 
PPT
Redefining Collaboration through Groupware - From Groupware to Societyware
Larry Smarr
 
PPT
The Coming of the Grid - September 8-10,1997
Larry Smarr
 
PPT
Supercomputers: Directions in Technology, Architecture, and Applications
Larry Smarr
 
PPT
High Performance Geographic Information Systems
Larry Smarr
 
PPT
Data Intensive Applications at UCSD: Driving a Campus Research Cyberinfrastru...
Larry Smarr
 
PPT
Enhanced Telepresence and Green IT — The Next Evolution in the Internet
Larry Smarr
 
PPTX
The CENIC AI Resource CENIC AIR - CENIC Retreat 2024
Larry Smarr
 
PPTX
The CENIC-AI Resource: The Right Connection
Larry Smarr
 
PPTX
The Pacific Research Platform: The First Six Years
Larry Smarr
 
PPTX
The NSF Grants Leading Up to CHASE-CI ENS
Larry Smarr
 
PPTX
Integrated Optical Fiber/Wireless Systems for Environmental Monitoring
Larry Smarr
 
PPTX
Toward a National Research Platform to Enable Data-Intensive Open-Source Sci...
Larry Smarr
 
PPTX
Toward a National Research Platform to Enable Data-Intensive Computing
Larry Smarr
 
PPTX
Digital Twins of Physical Reality - Future in Review
Larry Smarr
 
Smart Patients, Big Data, NextGen Primary Care
Larry Smarr
 
Internet2 and QUILT Initiatives with Regional Networks -6NRP Larry Smarr and ...
Larry Smarr
 
Internet2 and QUILT Initiatives with Regional Networks -6NRP Larry Smarr and ...
Larry Smarr
 
National Research Platform: Application Drivers
Larry Smarr
 
From Supercomputing to the Grid - Larry Smarr
Larry Smarr
 
The CENIC-AI Resource - Los Angeles Community College District (LACCD)
Larry Smarr
 
Redefining Collaboration through Groupware - From Groupware to Societyware
Larry Smarr
 
The Coming of the Grid - September 8-10,1997
Larry Smarr
 
Supercomputers: Directions in Technology, Architecture, and Applications
Larry Smarr
 
High Performance Geographic Information Systems
Larry Smarr
 
Data Intensive Applications at UCSD: Driving a Campus Research Cyberinfrastru...
Larry Smarr
 
Enhanced Telepresence and Green IT — The Next Evolution in the Internet
Larry Smarr
 
The CENIC AI Resource CENIC AIR - CENIC Retreat 2024
Larry Smarr
 
The CENIC-AI Resource: The Right Connection
Larry Smarr
 
The Pacific Research Platform: The First Six Years
Larry Smarr
 
The NSF Grants Leading Up to CHASE-CI ENS
Larry Smarr
 
Integrated Optical Fiber/Wireless Systems for Environmental Monitoring
Larry Smarr
 
Toward a National Research Platform to Enable Data-Intensive Open-Source Sci...
Larry Smarr
 
Toward a National Research Platform to Enable Data-Intensive Computing
Larry Smarr
 
Digital Twins of Physical Reality - Future in Review
Larry Smarr
 

Recently uploaded (20)

PPT
Coupa-Kickoff-Meeting-Template presentai
annapureddyn
 
PDF
Building High-Performance Oracle Teams: Strategic Staffing for Database Manag...
SMACT Works
 
PDF
Revolutionize Operations with Intelligent IoT Monitoring and Control
Rejig Digital
 
PDF
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PDF
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
PDF
Chapter 2 Digital Image Fundamentals.pdf
Getnet Tigabie Askale -(GM)
 
PPTX
Stamford - Community User Group Leaders_ Agentblazer Status, AI Sustainabilit...
Amol Dixit
 
PPTX
How to Build a Scalable Micro-Investing Platform in 2025 - A Founder’s Guide ...
Third Rock Techkno
 
PPTX
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
PDF
CIFDAQ'S Market Insight: BTC to ETH money in motion
CIFDAQ
 
PDF
Cloud-Migration-Best-Practices-A-Practical-Guide-to-AWS-Azure-and-Google-Clou...
Artjoker Software Development Company
 
PDF
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
PDF
REPORT: Heating appliances market in Poland 2024
SPIUG
 
PDF
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
PDF
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
PDF
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
PDF
A Day in the Life of Location Data - Turning Where into How.pdf
Precisely
 
PDF
How-Cloud-Computing-Impacts-Businesses-in-2025-and-Beyond.pdf
Artjoker Software Development Company
 
PDF
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
Principled Technologies
 
PDF
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
Coupa-Kickoff-Meeting-Template presentai
annapureddyn
 
Building High-Performance Oracle Teams: Strategic Staffing for Database Manag...
SMACT Works
 
Revolutionize Operations with Intelligent IoT Monitoring and Control
Rejig Digital
 
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
Chapter 2 Digital Image Fundamentals.pdf
Getnet Tigabie Askale -(GM)
 
Stamford - Community User Group Leaders_ Agentblazer Status, AI Sustainabilit...
Amol Dixit
 
How to Build a Scalable Micro-Investing Platform in 2025 - A Founder’s Guide ...
Third Rock Techkno
 
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
CIFDAQ'S Market Insight: BTC to ETH money in motion
CIFDAQ
 
Cloud-Migration-Best-Practices-A-Practical-Guide-to-AWS-Azure-and-Google-Clou...
Artjoker Software Development Company
 
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
REPORT: Heating appliances market in Poland 2024
SPIUG
 
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
A Day in the Life of Location Data - Turning Where into How.pdf
Precisely
 
How-Cloud-Computing-Impacts-Businesses-in-2025-and-Beyond.pdf
Artjoker Software Development Company
 
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
Principled Technologies
 
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 

Building a Community Cyberinfrastructure to Support Marine Microbial Ecology Metagenomics

  • 1. Building a Community Cyberinfrastructure to Support Marine Microbial Ecology Metagenomics Invited Talk Metagenomics 2006 Calit2 @ UCSD La Jolla, CA October 4, 2006 Dr. Larry Smarr Director, California Institute for Telecommunications and Information Technology Harry E. Gruber Professor, Dept. of Computer Science and Engineering Jacobs School of Engineering, UCSD
  • 2. Challenge: Average Throughput of NASA Data Products to End User is < 50 Mbps Tested October 2005 http://ensight.eos.nasa.gov/Missions/icesat/index.shtml Internet2 Backbone is 10,000 Mbps! Throughput is < 0.5% to End User
  • 3. Dedicated Optical Channels Makes High Performance Cyberinfrastructure Possible Parallel Lambdas are Driving Optical Networking The Way Parallel Processors Drove 1990s Computing ( WDM) Source: Steve Wallach, Chiaro Networks “ Lambdas”
  • 4. National Lambda Rail (NLR) and TeraGrid Provides Cyberinfrastructure Backbone for U.S. Researchers San Francisco Pittsburgh Cleveland San Diego Los Angeles Portland Seattle Pensacola Baton Rouge Houston San Antonio Las Cruces / El Paso Phoenix New York City Washington, DC Raleigh Jacksonville Dallas Tulsa Atlanta Kansas City Denver Ogden/ Salt Lake City Boise Albuquerque UC-TeraGrid UIC/NW-Starlight Chicago International Collaborators NLR 4 x 10Gb Lambdas Initially Capable of 40 x 10Gb wavelengths at Buildout NSF’s TeraGrid Has 4 x 10Gb Lambda Backbone Links Two Dozen State and Regional Optical Networks DOE, NSF, & NASA Using NLR
  • 5. The OptIPuter Project – Creating High Resolution Portals Over Dedicated Optical Channels to Global Science Data NSF Large Information Technology Research Proposal Calit2 (UCSD, UCI) and UIC Lead Campuses—Larry Smarr PI Partnering Campuses: SDSC, USC, SDSU, NCSA, NW, TA&M, UvA, SARA, NASA Goddard, KISTI, AIST, CRC(Canada), CICESE (Mexico) Engaged Industrial Partners: IBM, Sun, Telcordia, Chiaro, Calient, Glimmerglass, Lucent $13.5 Million Over Five Years—Now In the Fifth Year NIH Biomedical Informatics Research Network NSF EarthScope and ORION
  • 6. OptIPuter Software Architecture--a Service-Oriented Architecture Integrating Lambdas Into the Grid GTP XCP UDT LambdaStream CEP RBUDP Globus XIO GRAM GSI Source: Andrew Chien, UCSD DVC Configuration Distributed Virtual Computer (DVC) API DVC Runtime Library Distributed Applications/ Web Services Telescience Vol-a-Tile SAGE JuxtaView Visualization Data Services LambdaRAM DVC Services DVC Core Services DVC Job Scheduling DVC Communication Resource Identify/Acquire Namespace Management Security Management High Speed Communication Storage Services IP Lambdas Discovery and Control PIN/PDC RobuStore
  • 7. Calit2 “Lives in the Future” By Building Systems of Emerging Disruptive Technologies Co-Evolution of Personal Automobile and Highway/Petroleum Infrastructure Source: Harry Dent, The Great Boom Ahead Technologies Diffuse Into Society Following an S-Curve Calit2 Works Here {
  • 8. Calit2--A Systems Approach to the Future of the Internet and its Transformation of Our Society www.calit2.net Calit2 Has Assembled a Complex Social Network of Over 350 UC San Diego & UC Irvine Faculty Working in Multidisciplinary Teams With Staff, Students, Industry, and the Community Integrating Technology Consumers and Producers Into “Living Laboratories”
  • 9. Calit2 Brings Computer Scientists and Engineers Together with Biomedical Researchers Some Areas of Concentration: Metagenomics Genomic Analysis of Organisms Evolution of Genomes Cancer Genomics Human Genomic Variation and Disease Proteomics Mitochondrial Evolution Computational Biology Information Theory and Biological Systems UC San Diego UC Irvine
  • 10. Evolution is the Principle of Biological Systems: Most of Evolutionary Time Was in the Microbial World Source: Carl Woese, et al You Are Here Much of Genome Work Has Occurred in Animals
  • 11. PI Larry Smarr Paul Gilna Ex. Dir. Calit2 is Now Attracting Private Foundation Grants Announced January 17, 2006--$24.5M Over Seven Years
  • 12. Marine Genome Sequencing Project – Measuring the Genetic Diversity of Ocean Microbes Sorcerer II Data Will Double Number of Proteins in GenBank!
  • 13. Current Universe of Medium/ Large Protein Families Source: Shibu Yooseph, et al. (PLOS Biology in press 2006) Protein Families Conserved Across Tree of Life Protein Families Unique to GOS 17,067 Protein Family Clusters
  • 14.  
  • 15. Calit2’s Direct Access Core Architecture Will Create Next Generation Metagenomics Server Traditional User Response Request Source: Phil Papadopoulos, SDSC, Calit2 + Web Services Sargasso Sea Data Sorcerer II Expedition (GOS) JGI Community Sequencing Project Moore Marine Microbial Project NASA Goddard Satellite Data Community Microbial Metagenomics Data Flat File Server Farm W E B PORTAL Dedicated Compute Farm (1000 CPUs) TeraGrid: Cyberinfrastructure Backplane (scheduled activities, e.g. all by all comparison) (10000s of CPUs) Data- Base Farm 10 GigE Fabric Web (other service) Local Cluster Local Environment Direct Access Lambda Cnxns
  • 16. The Bioinformatics Core of the Joint Center for Structural Genomics will be Housed in the Calit2@UCSD Building Extremely Thermostable -- Useful for Many Industrial Processes (e.g. Chemical and Food) 173 Structures (122 from JCSG) Determining the Protein Structures of the Thermotoga Maritima Genome 122 T.M. Structures Solved by JCSG (75 Unique In The PDB) Direct Structural Coverage of 25% of the Expressed Soluble Proteins Probably Represents the Highest Structural Coverage of Any Organism Source: John Wooley, UCSD
  • 17. Interactive Visualization of Thermatoga Proteins at Calit2 Source: John Wooley, Jurgen Schulze, Calit2
  • 18. OptIPortal– Termination Device for the OptIPuter Global Backplane 20 Dual CPU Nodes, 20 24” Monitors, ~$50,000 1/4 Teraflop, 5 Terabyte Storage, 45 Mega Pixels--Nice PC! Scalable Adaptive Graphics Environment ( SAGE) Jason Leigh, EVL-UIC Source: Phil Papadopoulos SDSC, Calit2
  • 19. Calit2 is Now OptIPuter Connecting Remote Moore-Funded Microbial Researchers NW! CICESE UW JCVI MIT SIO UCSD SDSU UIC EVL UCI OptIPortals OptIPortal
  • 20. Calit2 and the Venter Institute Will Combine Telepresence with Remote Interactive Analysis Live Demonstration of 21st Century National-Scale Team Science OptIPuter Visualized Data HDTV Over Lambda 25 Miles Venter Institute
  • 21. Countries are Aggressively Creating Gigabit Services: Interactive Access to CAMERA and LOOKING Systems www.glif.is Created in Reykjavik, Iceland 2003 Visualization courtesy of Bob Patterson, NCSA.
  • 22. New OptIPuter Driver: Gigabit Fibers on the Ocean Floor -- Controlling Sensors and HDTV Cameras Remotely National Science Foundation Is Planning a New Generation of Ocean Observatories Ocean Research Interactive Observatory Networks (ORION) Fibered Observatories Linked to Land Fiber Infrastructure Laboratory for the Ocean Observatory Knowledge Integration Grid ( LOOKING ) Building a Prototype Based on OptIPuter Technologies Plus Web/Grid Services HDTV Streams Over IP Will be a Major Driver (Funded by NSF ITR- John Delaney, UWash, PI) LOOKING is Driven By NEPTUNE CI Requirements Making Management of Gigabit Flows Routine
  • 23. Using the OptIPuter to Couple Data Assimilation Models to Remote Data Sources Including Biology Regional Ocean Modeling System (ROMS) http://ourocean.jpl.nasa.gov/ NASA MODIS Mean Primary Productivity for April 2001 in California Current System
  • 24. Deploying Novel Infrastructure Enables New Science: Gigabit Fibers on the Ocean Floor Source: John Delaney & Deborah Kelley, UWash Canadian-U.S. Collaboration An Experiment in the NSF Laboratory for the Ocean Observatory Knowledge Integration Grid (LOOKING) ITR Prototype of CI for NSF’s ORION
  • 25. High Definition Still Frame of Hydrothermal Vent Ecology 2.3 Km Deep White Filamentous Bacteria on 'Pill Bug' Outer Carapace Source: John Delaney and Research Channel, U Washington 1 cm.
  • 26. A Near Future Metagenomics Fiber Optic-Enabled Data Generator Source John Delaney, UWash